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Abstract 

Being HIV-l-PR an essential enzyme in the viral life cycle, its inhibition can control AIDS. 
Because the folding of single domain proteins, like HIV-l-PR is controlled by local elementary 
structures (LES, folding units stabilized by strongly interacting, highly conserved amino acids) 
which have evolved over myriads of generations to recognize and strongly attract each other so 
as to make the protein fold fast, we suggest a novel type of HIV-l-PR inhibitors which interfere 
with the folding of the protein: short peptides displaying the same amino acid sequence of that of 
LES. Theoretical and experimental evidence for the specificity and efficiency of such inhibitors are 
presented. 
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I. INTRODUCTION 



HIV-l-PR is a homodimer, that is a protein whose native conformation is built of two 
(identical) disjoint chains each of them made of 99 amino acids. Sedimentation equilibrium 
experiments have shown that at neutral pH the protease folds according to a three-state 
mechanism {2U 2N — * N2), populating consistently the monomeric native conformation 
N The dimer dissociation constant (2A^ N2) is = 5.8 fiM at room temperature, 
while the folding temperature of the monomer, i.e. the temperature at which the free energy 
of the native monomeric state is equal to that of the unfolded state U is Tj = 52. 5C 2]- 

Monte Carlo simulations of the folding of three-state homodimers |3| indicate that after 
the monomers have reached the native state N, they diffuse to find another folded monomer 
to associate with. The same study also shows that in the stage U —>■ N, each monomer 
folds according to the same mechanism controlling the folding of single domain proteins. 
This mechanism is reasonably well understood as a result, among other things, of extensive 
lattice model studies as well as of all-atom off-latice Go-model simulations (cf. ref. jj| and 
refs. therein). These studies revealed the central role played by few, strongly interacting 
highly conserved, as a rule hydrophobic, amino acids. Mutations of these "hot" amino 
acids lead, in general, to denaturation p. It has been shown 0, Q] that, starting from 
an unfolded conformation, folding proceeds following a hierarchical succession of events: 1) 
formation of LES stabilized by the interaction among "hot" amino acids lying close along 
the polypeptide chain, 2) docking of the LES into the (post critical) folding nucleus, that is 
formation of the minimum set of local contacts which brings the system over the major free 
energy barrier of the whole folding process, 3) relaxation of the remaining amino acids on 
the native structure shortly after the formation of the folding nucleus. Strong support for 
the soundness of this hierarchical scenario is found in a number of circumstantial evidences 

Qy,yiii, 
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The same model ja, |2| suggests that it is possible to destabilize the native conformation 
of a protein with the help of peptides whose sequences are identical to those of the LES 
of the protein [l^. Such peptides (p-LES) interact with the protein (in particular with 
their complementary LES) with the same energy which stabilizes its folding nucleus, thus 
competing with its formation. 

One can mention two advantages of these non-conventional (folding-) inhibitors as com- 
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pared to conventional (active site centerd-) inhibitors: I) their molecular structure is direclty 
fixed by that of the target protein. One has thus not to design a molecule so as to optimize 
its interaction with a given site of the enzyme, but just find which are the LES of the protein 
to be inhibited. The design of the p-LES having been performed by evolution. In fact, like 
the pair enzyme/substrate, pairs of complementary LES have evolved through millions of 
years in order to learn how to recognize its partners as well as to avoid aggregation, II) 
it is unlikely that the protein can develop drug-resistance through mutations as LES are 
stabilized by hot amino acids. 

The present paper is divided in two main parts. A theoretical one (Sections 2 and 3) 
where the prediction of the model are presented, and an experimental one (Sect. 4), where 
the properties of the designed inhibitor are tested. 

II. FOLDING UNITS OF HIV 1 PR: LOCAL ELEMENTARY STRUCTURES 

The first step to be taken in the design of an inhibitor of the HIV-l-PR following the 
strategy discussed in the previous Section is to determine the LES of the target protein. In 
other words, segments of the protein containing highly conserved residues. To this scope use 
is made of evolution data input as well as model simulations. 

A. Evolution 

A measure of the degree of conservation of residues in a family of proteins is the entropy 
per site S{i) = — J2aPi('^) ^^Pii'^)^ where Pi(cr) is the frequency of appearence of residue of 
type 0" at site i. Making use of a family of 28 uncorrelated proteiris (i.e., displaying sequence 
similarity lower than 25%) structurally similar to HIV-l-PR |l7|, the entropy S{i) was 
calculated for all 99 sites of the monomers. From the result shown in Fig. ^a) (continous 
curve) it is observed that the most conserved regions of the proteins involve residues 24-33, 
56-61 and 81-87. Note that the conservation of residues 25-27 is not unexpected in that they 
build the active site of the protease. 

Another important source of information concerning protected sites of the HIV-l-PR 
stems from drug-induced mutations observed in vivo. The large production of virion in 
the cell, coupled with the error prone reverse transcriptase enzyme, eventually leads to 
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escape mutants. Mutations observed in the stretches of the HIV-l-PR mentioned above 
are, as a rule, conservative mutations (e.g. L24I) Q^Il^- Less conservative mutations, e.g. 
hydrophobic to hydrophihc, are absent from these structurally protected regions. 

B. Model calculations: energetics 

In order to become stable at an early stage of the folding process, LES must carry a 
significant fraction of the total energy of the protein. We have calculated this quantity 
making use of short all-atom molecular dynamics simulations in explicit water using GRO- 
MACS software The calculation was performed at 295 K for 1 ns, during which the 
protein fluctuates around its native conformation (the maximum RMSD being 11 A). We 

n 

have followed the scheme described in ref. j20|, calculating the average interaction energy 
Bij between any pair of amino acids during the Ins dynamics and extracting the eigenvalues 
A„ of the resulting matrix || Bij ||. The lowest energy state (Ai=-121.2 kJ/mol) displays a 
large energy gap (i.e. -12.9 kJ/mol~ 5kT) with respect to the next eigenvalue, indicating 
a core of strongly interacting amino acids [2^. We display in Fig. ^b) the eigenvector 
associated with the lowest energy eigenvalue, which highlights to which extent the different 
amino acids participate to this core. The largest amplitudes involve residues 25-32, 57-65, 
74-77 and 83-90. Regions 25-32, 57-65 and 83-90 overlap well with the conserved regions 
mentioned above in connection with Fig. ^^a). 

Another way of representing the interaction matrix || Bij \\ is to consider the interaction 
between fragments Si = (13 - 21), ^2 = (24 - 34), ^3 = (38 - 48), = (50 - 55), 
^5 = (56 - 66), ^6 = (67 - 72), ^7 = (74 - 78), Sg = (84 - 94). As can be seen from 
Table HJ the corresponding 8x8 energy map is essentially codiagonal, the associated energy 
of the interacting chain segment being —1751 kJ/mol as compared to the MD simulation 
native energy —2533 kJ/mol. In other words, the Si — Sg representation of the folded 
monomer accounts for ^ 70% of the calculated native conformation energy. In particular, 
the interaction between fragments 5*2, 5*5, 5*7 and S'g is stabilized by —796 kJ/mol equal to 
32% of the native conformation energy. Of this energy 20% (= —503 kJ/mol) corresponds 
to the internal energy of the four fragments, while 12% (= —293 kJ/mol) corresponds to 
the summed interaction energy between fragments. 

In keeping with the above results the 5*2, ^5, 5*7 and 5*8 segments qualify as LES (folding 
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units) of each of the two monomers of the HIV-l-PR dimer, LES which form in their native 
conformation the (post critical) folding nucleus (FN). It is interesting to note that drug 
induced mutations in the amino acids belonging to these LES (L24I, D30N, L33F, L63V, 



I64V, V77I, V82A, I84D, N88D and L90M 



m 



2 lip, lead to a folding nucleus energy equal 



to -1500 kJ/mol, as compared to -796 kJ/mol for the wild type sequence FN, that is to 
increase of almost a factor of 2 in the stability of the system. 



C. Further evidence 

Wallqvist and coworkers [lo| investigated the HIV-l-PR molecule for the occurence of 
cooperative folding units that exhibit a relatively stronger protection against unfolding than 
other parts of the molecule. Unfolding penalities are calculated forming all possible com- 
binations of interactions between segments of the native conformation and making use of a 
knowledge-based potential. This procedure identifies a folding core in HIV-l-PR comprising 
residues 22-32, 74-78 and 84-91, residues that form a spatially close unit of a helix (84-91) 
with sheet (74-78) above another /3-strand ((22-25), containing the active site residues D25, 
T26 and G27) perpendicular to these elements. 



221 have studied the 



Making use of a Gaussian network model, Bahar and coworkers 
normal modes about the native conformation of HIV-l-PR. "Hot" residues, playing a key 
role in the stability of the protein, are defined as those displaying the fastest modes. In this 
way regions 22-32, 74-78 and 84-91 are identified as those forming the folding core of the 
protein. These regions match with those displaying low experimental Debye-Weller factors, 
that is low fluctuations in the crystallographic structure. 

Calculation of ip-values by means of Go-model simulations performed by Levy and 
coworkers j^] have located a major transition state where only regions 27-35 and 79-87 are 
structured. The protein then reaches the native state, overcoming another minor transition 
state, and subsequently dimerizes into the biologically active structure. 

Cecconi and coworkers have calculated the stability temperatures associated with 
each contact of the protease, again making use of a Go-model. They flnd that key sites to the 
stability of partially folded states, that is those displaying the lowest stability temperatures, 
are 22, 29, 32, 76, 84 and 86. 
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D. Dynamics of contact formation 



To directly investigate the folding mechanism of the HIV-l-PR monomer, use is made of 
a simplified model which has proven useful to account for the folding properties of a number 
of small, single-domain proteins j^]. The model pictures each amino acid as a spherical 
bead, making inextensible links with the rest of the chain. Each pair of amino acid interacts 
through a modified Go-contact potential 3| 

U{{r,}) = J2B^Ai\n-r,\), (1) 

i<j 

where is the coordinate of the ith amino acid, A(|rj — rj\) is a contact function which 
assumes the value 1 if the ith and jth residues are closer than Q.3A and zero otherwise, Bij 
being the energy matrix elements defined in Sect. 2b. Moreover, each amino acid displays 
a hard core of 3.8 A. 

Equilibrium Monte Carlo simulations of the HIV-l-PR monomer provide free energy 
landscapes as a function of the fraction of native energy of the protein. The free energy 
at T = 300K, shown in Fig. El^a) with a solid curve, displays two major wells, corresponding 
to the unfolded and the native states. The grey beads mark conformations met during the 
sampling, and their position along the y-axis indicate the associated RMSD (cf. scale on the 
left y-axis). Note that the native well is quite broad, ranging from qe = 0.75 to qe = 0.95 
and from an RMSD of 3 to 7 A. Also the free energy barrier to the unfolded state is 
rather small, being about 18 kJ/mol (~ 7 kT). These data indicate a two-state transition 
characterized by a weak degree of collectivity. To be noted that in the present calculations we 
only considered the monomeric form of the protein. Dimerization can increase the stability 
and the collectivity of the system. 

Making use of the same model it is possible to run dynamical simulations, starting from 
random conformation and following the folding to the native state. The overall dynamics 
can be followed through the plot of [qE]{t), that is the fractional native energy as a function 
of time, averaged over 100 independent runs. The result is displayed as a solid curve in 
Fig. Efb), indicating an exponential process of characteristic time r = 2.9 ■ 10~^s, consistent 
with the two-well picture. The model provides also information about the formation of each 
contact, through the probability Pij{t) that the contact between residues i and j is formed 
at time t. A number of native contacts are stabilized early (sub-nanosecond time scale) 
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following an exponential dynamics. This is the case for contacts belonging to 5*2 (25-28) 
and in particular to 5*8 (e.g. contacts 86-89, 87-90, 89-92, 90-93). Contacts between residues 
which are far along the chain are formed later, following a non-exponential dynamics, which 
indicate that their formation is dependent on some other event. The earliest involve the 
interaction between fragments — Sj and 5*2 — 5*8 which take place after an average time 
of few and of tens of ns, respectively. As an example we display in Fig. Efb) the formation 
probability of the contact 31-89 as a function of time. 

In Fig. |21^c) is summarized the hierarchy of formation of native contacts of HIV-l-PR, 
the different gray levels corresponding to different time scales, while in Table |n] are listed 
the parameters associated to the selected contacts. The picture that emerges is that local 
contacts within fragments 83-93, between residues 25-28 and in the beta-hairpin 42-58 form 
first. Then the beta-turns 15-18 and 65-70, again built of local residues. The next event is 
the assembly of the nucleus involving fragments 22-34 and 83-93, which is further stabilized 
by the contribution of the strongly-interacting bend 77-83. Finally, the rest of the residues 
come to place. 

Summing up, the model suggests that LES are built of residues which lie in the regions 
83-93 and 22-34. This essentially agrees with the indirect indications provided by the 
studies of Levy and coworkers 123^, which indicate the formation of regions 27-35 and 79-87 
as determinant for the folding of the protein. The stabilization core [23] is somewhat larger 
(cf. Sect. Ill B|l . involving, besides residues 84-93 and 22-34, also the regions 56-66 and 
74-78. 

From the above discussion one can conclude that peptides p-S2 and p-S8 are the most 
likely candidates as folding inhibitors of the HIV-l-PR monomer. Because 5*8 is well struc- 
tured (it builds 10 internal contacts and an a-helix turn), while 5*2 has little internal struc- 
ture, we shall select p-S8 as the best candidate for a non-conventional inhibitor. It is 
expected that peptides p-S8 will efficiently attach to the S'2-LES, thus blocking the forma- 
tion of the folding nucleus or, in the case the protein is already folded, by profiting favourable 
fiuctuations around the native conformation, to come into contact with the S'2-LES thus 
blocking the return of the monomer to the original native conformation. 
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III. INHIBITION OF THE FOLDING OF THE HIV 1 PR 



Within the framework of the Go model discussed above, Monte Carlo simulations were 
carried out for a system composed of a single 99mer and a number (from 1 to 5) of p-Sg 
peptides, placed inside a box of side of 100 A with periodic boundary conditions. 

In Fig. El^a) the resulting free energy of the protein as a function of is displayed. The 
presence of p-Sg peptides has a number of effects: a) it broadens the minimum associated 
with the native conformation, b) it lowers by ~ lOkJ/mol the free energy of the minimum 
associated with the unfolded state, c) it lowers by about the same amount the free energy of 
the maximum of the barrier separating the two minima (transition state). These effects are 
associated with a decrease of the relative population of the native state from 0.98 to 0.78 in 
the case of 3 peptides and to 0.52 in the case of 5 peptides. Because the above results were 
obtained from calculations which describe the system at equilibrium they apply equally well 
to situations in which one starts with the monomer in the state N or in state U. 

To test the validity of these results, we have repeated the simulations using peptides 
corresponding to fragments of the protease monomers different from the LES, but having 
a similar number of amino acids. It is found that e.g., fragments 9-19 and 61-70 do not 
have any significant effect on the free energy of the native state (cf. Fig. Efb)). This result 
testifies to the fact that the denaturing effect observed in the results shown in FigE^a) is 
due not to some unspecific interaction between the peptides and the protein, but to the very 
choice of LES-mimicking peptides. 

To further clarify the mechanism which is at the basis of the results shown in Fig. IHl^a), 
we display in Fig. El^b) a snapshot of the simulation carried out with 3 peptides. It is 
seen that one of the p-Ss peptides has built (strong) contacts stabilizing the folding nucleus 
of the protein. Consequently, the state corresponding to the correctly folded monomer 
(Fig. El^a)) and to the state in which the peptide binds to the partially folded protein have 
similar energies but very different free energies. In fact, the main difference between the two 
conformations displayed in Fig. Ufa) andEfb) is entropic: while the partially folded protein 
has a much larger entropy as compared to the folded protein (which essentially is in a single 
conformation), the complexed state is penalized by the entropy decrease associated with the 
binding of a single peptide, decrease which can be reduced at will by increasing the number 
of peptides. 
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Consequently, whatever the folding entropy cost of binding a p-Ss is, there exists a 
concentration of peptides which makes the complexed, partially-unfolded state favourable. 
In the case of the present simulations, a concentration of peptides equal to few times that 
of the protein seems to be adequate to consistently destabilize the protease. 

We have also analyzed the effect pointlike mutations [35] on the HIV-l-PR have on the 
inhibitng ability of p-Ss peptides. Mutations on sites not belonging to LES have little or 
no effect on the stability of the protease, but at the same time do not affect the inhibitory 
action of p-Sg. For example, mutations on site 73 cause the protease to decrease its stability 
at 300 K from 98% (observed for the "wild-type" sequence) to 97%, the effect of 3 p-Ss 
peptides leading to a decrease of this stability from 97% to 32% (to be compared with the 
98% to 52% decrease observed in the case of the "wild-type" sequence). Mutations in sites 
belonging to the LES of the HIV-l-PR, on the other hand, cause destabilization of the 
native state of the protease, shifting the equilibrium distribution of the order parameter 
towards lower values. For example, mutation in site 34 causes the stability of the protease 
to diminish to 64% (from the value of 98%). 



IV. SPECTROPHOTOMETRIC MEASUREMENTS 



Following the theoretical predictions discussed above, we have investigated the inhibitory 
properties of the peptide of sequence NIIGRNLLTQI {pSs = 83-93) by means of a spec- 

28^ and making use of ultraviolet circular dichroism 



Tophotometric assay as described in ref . 

3- 



A. Materials and methods 



Recombinant HlV-l-Protease, expressed in E. Coli (Sigma Cat. no. P7338) Q, Q 
contained five mutations to restrict autoproteolysis (Q7K, L33I, L36I) and to restrict cys- 
teine thiol oxidation (C67A and C95A). The enzyme was stored at (-70C as solution with 
concentration 25fj,g/63fiL in dilute HCl, (pll=1.6) ). A chromogenic substrate for HIV- 
l-PR (Sigma Cat. no. H5535) (with sequence Arg-Val-Nle-Phe(N02)-Glu-Ala-Nle-NH2) 
was obtained as a 1 mg desiccate, diluted with 0.1 ml of DMSO, and stored at -20C. Pro- 
tease assisted cleavage between the Nle and the Phe(N02) residues of substrate entails a 
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blue-shift of the absorption maximum (277 nm to 272 nm). It can be adequately moni- 
tored observing the continuous decrease of absorbance at 310 nm A regression of the 
absorbance at 300 nm against substrate concentration allows to estimate the extinction co- 
efficient of the whole substrate {S = 6300 ± 1600 {mol cm)~^). Moreover, the absorbance 
at 300 nm after complete peptidolysis allows to determine a differential extinction coefficent 
(= 1500 ± 250 {mol cm)~^) between the whole substrate and the cleaved products. This 
compares well with a difference of extinction coefficient at 310 nm between the cleaved and 
the complete substrate of 1200 ± 100 {mol cm)~^, reported in ref. 28^ . 

Inhibitor peptide (peptide p-Sg) from the primary sequence of the HIV-l-PR wild type 
(PDB code IBVG) were synthesized by solid phase synthesis (Primm, San Raffaele Biomed- 
ical Science Park, Milan) with acetyl and amide as terminal protection group and was 
estimated to be > 95% pure by analytical HPLC after purification. After that 1 mg of 
inhibitor peptide was dissolved in 100 ^\ of DMSO, 4 /il of this solution were then diluted 
with 16 /il of DMSO and 180 /il of the buffer used for assay. The obtained solution (150 /iM 
of peptide p-Sg) was used for the experiments. 

The buffer was prepared by adding 0.8 mM NaCl, 1 mM EDTA and 1 mM dithiothreitol 
to a 20 mM phosphate buffer (pH 6). Other peptides displaying a sequence equal to that 
of selected fragments of the protease, but not belonging to any LES, as well as another 
peptide displaying a random sequence, were also used in the same way as peptide p-Sg. 
The sequences of these control peptides are QILIEICGHK and PLVTIKIGGQL correspond- 
ing to fragments 61-70 and 9-19, respectively, of the HIV-l-PR sequence, as well as the 
LSQETFDLWKLLPEN sequence, which is not related in any way to the protease. We will 
call these peptides Kl, K2 and K3, respectively. To be noted that peptide K2 is rather 
hydrophobic and only > 70% purity could be achieved. 

Each measure was performed recording the absorbance at 300 nm (4.13 eV) in a standard 
UV-vis spectrophotometer (Jasco V-560). The sample had a total volume of 70 fiL in 
Spectrosil Far UV Quartz (170-2700 nm) cuvettes (3.3 mm optical path). The sample in 
the cuvette was exposed to a constant temperature (37 ± 0.05C) provided by continuous 
circulation of water from a waterbath to the cell holder via a circulation pump. 

After proper thermal stabilization of the substrate dissolved in the buffer (pH 6), the ab- 
sorbance at 300 nm was recorded. We performed 6 measures spanning a range of substrate 
concentrations from 50 fiM to 250 fiM. The reaction was initiated by adding 2.78 fig of 
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enzyme and recording the asbsorbance decrease at 300 nm for 1200 sec. In all the reported 
measures the enzyme has thus a nominal concentration of 1.8 fiM. Since, to our knowl- 
edge, no accurate measure has been published of the dimerization constant of HIV-l-PR 
at pH=6.0, it is difficult to estimate precisely the active site concentration in our assay. It 
should however be noted that the active site concentration only affects the estimation of 
kcat, and has no influence over the enzyme- inhibitor dissociation constants. 

The assessment of the enzyme activity in presence of the inhibitor peptides followed a 
similar routine: the enzyme and the peptide (whose concentration are in a ratio 1:3) were 
incubated together for approximately 1 min and then added to the substrate. In all the 
reported measures the final concentration of inhibitor was [Iq] = 5.4 /iM. The asbsorbance 
decrease at 300 nm was recorded for 1200 sec. 

Ultraviolet CD spectra were recorded on a Jasco J-810 spectropolarimeter in nitrogen 
atmosphere at room temperature using 0.1 cm path- length quartz cell. Each spectrum was 
recorded between 260-200 nm. The data were collected at a rate of 10 nm/min with a wave- 
length step of 0.2 nm and a time constant of 2 s. The spectra were corrected with respect to 
the baseline and normalized to the aminoacidic concentration. The protein and the peptide 
were dissolved in a 20 mM phosphate buffer with 0.8 M NaCl at the same concentration used 
for the activity assays. The CD spectra were analyzed in terms of contribution of secondary 
structure elements using the K2D method based on comparison with CD spectra of 
proteins and peptides with known secondary structure. 

B. Results 

In order to provide a quantitative measure of the enzymatic activity of HIV-l-PR and 
the inhibitory mechanism of the designed peptides, we have performed a Michaelis-Menten 
analysis of the data reported in Fig. Efb), measuring the initial reaction velocity vq at 
different substrate concentrations (cf. Fig. |3fa)). We obtain an estimate of the dissociation 
constant Km for the uninhibited-enzyme/substrate complex (curve (2) of Fig. El^b)) of 
670 ± 860 fiM. The maximum velocity of the reaction Vmax results 0.40 ± 0.52 fiM/s, 
corresponding to a catalitic rate kcat = 0.25 ± 0.28s^^. Note that the order of magnitude of 
these results compares well with those reported for the same substrate in ref. ||2S], that is 
Km = 280 ± 100/iM and kcat = 7.3fiM/ s. The apparent dissociation constant i^^,^ of the 
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inhibited reaction is 332 ± 191/iM, the maximum velocity being Vmax = 0.026 ± 0.011/iM/s 
(cf. curve (1) of Fig. Efb)). The fact that the value Vmax changes in presence of the inhibitor 
while that of K'j^j remains within the experimental error of Km suggests that the inhibition 
is compatible with a non- competitive scenario 3^, in which the inhibitor can bind to the 
free enzyme with a dissociation constant Kj or to the substrate-enzyme complex with a 
dissociation constant Kj not significantly different from Kj . The results shown in Fig. Efa) 
lead in fact to the values Kj = 380 ± 810nM and K'j = 180 ± 234nM. The caveats and the 
physical meaning of the interpretation of Fig. Efa) in terms of a classical mixed models will 
be examined in the next section. 

We note that the large errors in the estimate of the kinetic parameters are due to the 
small number of points recorded. An alternative method to determine the value of these 



parameters invo 
Menten kinetic 



ves the fitting of the data with the analytical solution of the Michaelis- 
, solution which can be written in terms of the reduced concentation 
S' = S/Km as: [S']{t) = W{[S'o]exp{-kt + [S'^])), W being Euler's Omega function and 

k = Vmax/ Km- 

Using a nonlinear fit to the data with the above expression one can obtain a more robust 
estimate of both Km and Vmax using the complete time-dependence of the concentration 
of free enzyme, expressed as a function of the measured absorbance as [S](t) = l/[A(t) — 
A{oo)]/6e. In particular, using this method on curve (2) of Fig. Efb), we have been able to 
determine the kinetic values of the protease-substrate reaction obtaining Km = 656±320/^M 
and kcat = 0.49 ±0.08 s^^. These values characterize more accurately the enzymatic process 
than the values obtained from the analysis reported in Fig. \^a). 

We have repeated similar measures using this time the control peptides Kl, K2 and K3 
instead of peptide p-Ss, and found no appreciable variation in the kinetic parameters with 
respect to the uninhibited case (cf. curve (3) and (4) of Fig. El^b)). In the presence of peptide 
Kl or K2 the reaction displayed initial velocities vq = 0.051/iM/s and vq = 0.049/iM/s, 
respectively, essentially identical to the value of vq obtained from the global fit to the curve 
(2) of Fig. Uh). 

The conformational properties of the inhibited protease have been investigated by means 
of UV circular dichroism experiments. The CD spectrum of the protease (cf. Fig. ^ 
under the same conditions used for the activity assay indicate a beta-sheet content of 79%, 
consistent with the beta character of the native conformation The CD spectrum of 
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the solution of protease plus p-Sg inhibitor, at the same concentrations and under the same 
conditions as those of the activity assay, shows a loss of beta-structure (to a beta-sheet 
content value of 45%), indicating that the protein is, to a large extent, in a non-folded 
conformation. These numbers compare well with those predicted by the model calculations 
(cf. caption to Fig. Hj). 



C. Discussion 



The results described above show that the p-Ss peptide inhibits the activity of the HIV- 
1-PR, interefering with the native character of its equilibrium state. Although a Lineweaver- 
Burk plot like that displayed in El^a) is usually interpreted in terms of a non-competitive 
kind of inhibijion, it can also describe the case of a irreversible binding of the inhibitor to 
the enzyme js^. In fact, in a reaction of the kind (where / =p-S8) 

E + S ^ ES P 

k-s 

+ (2) 

k+i 

I ^ EI 

k-i 

where the rates k^j and k_j are much smaller than k^s aiid k_s, one can consider the 
concentration [EI] as constant on a time scale smaller than (A;_/)^^, and given by [EI] = 
{k^j/k-^-i){[E]o — [EI])[I]q. In this context, irreversibility means that on the time scale 
relevant for the experiment the dissociation of EI into E and I is negligible. The concentration 
of enzyme available to the equilibrium with substrate is then, on this time scale, [E]q — [EI]. 
This leads to a Michaelis-Menten equation identical to that associated with non-competitive 
inhibition, of the form 



[S]o[E]o [l + l^^y k^ 



^cat 

= —wiTfif 

where we have set fcj = k^j/k^j . Note that the difference between the irreversible scenario 
and the reversible competitive case is that, in the former, [EI] is proportional to [EO] while 
in the latter [EI] is proportional to the free enzyme concentration [E]. Such irreversible 
scenario is not unexpected in the case of folding inhibitors, where the inhibitor binds tightly 
to the enzyme (as tight as the interactions which stabilize the protein), the unfolding rate 
being of the order of lO^^s"^ js^, giving rise to a value of {k — I)^^ of the order of minutes. 
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V. CONCLUSIONS 



The peptide p-Sg (=83N, 841, 851, 86G, 87R, 88N, 89L, 90L, 91T, 92Q, 931) displaying 
a sequence identical to that of the Sg LES of each of the two identical chains forming the 
HIV-l-PR homodimer is found to be a highly specific and likely quite efficient inhibitor 
{Kj = 380 ± 810 nM) of the folding of the 99mers, and thus of the whole enzyme. A 
remarkable property of this inhibitor is that it is unlikely that it would allow for escape 
mutants. In fact, the only mutations which will prevent p-Sg from acting are expected to 
involve protein denaturation. 

Obvious disadvantages of the present design are the length and the peptidic character of 
the inhibitor, because it is not clear how to prevent the degradation by cellular enzymes. 
Consequently, there are two clear tasks lying ahead. One, to investigate whether the short- 
ening of p-Sg, by leaving out some residues either at the beginning or at the end (or both), 
lead to peptides which still inhibit folding with similar specificity and effectiveness as p-Sg 
does. The second is to develop molecules mimetic to p-Sg or eventually to shorter peptides 
derived from p-Sg. 

We conclude by suggesting that the strategy employed to design p-Sg, being universal, 
can be used to design inhibitors of the folding of other target proteins. 
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TABLE I: The all atom molecular dynamics energy map of the eight amino acid chain segments 
with which one can represent in an economic fashion each of the two HIV-l-PR monomers. 
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FIG. 1: a) The entropy per site of proteins structurally similar to the HIV-l-PR monomer (pdb 
code: IBVG). The solid line indicates the entropy function calculated over 28 proteins displaying 
sequence similarity lower than 25% with the HIV-l-PR, while the dashed line is associated with 
462 proteins irrespective of sequence similarity, b) The components of the eigenvector associated 
with the lowest eigenvalue of the interaction matrix between amino acids 20] . 
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FIG. 2: a) The free energy of HIV-l-PR monomer as a function of the relative energy (order 
parameter) qe (black curve) and the values of RMSD sampled during the simulation (gray dots) at 
T = 300K; b) The formation probability of the native conformation (solid curve), the probability 
P87_9o(t) (dashed gray curve) and the probability P3i-89(i) (dotted gray curve), displayed as a 
function of time; c) Contact map of the HIV-l-PR, the different gray levels being associated 
with to different values of the average stabilization time. Darker symbols are formed in 10~^'^s 
while lighter symbols in 10~^s. The dynamics is simulated by mean of a dynamical Monte Carlo 
algorithm and the discrete time step is expressed in terms of time units by setting the simulated 
diffusion constant of the protein equal to 10~^m/s^. 
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FIG. 3: The free energy profile of the HIV-l-PR monomer (sohd curve) and of the system composed 
of the monomer plus 3 (dashed) and 5 (dotted) p-Sg peptides, as a function of the relative native 
energy qe- The temperature at which the simulations were carried out is 300 K. b) The free energy 
profile of the monomer alone (solid curve) and of the system composed of 3 peptides corresponding 
to the fragments 9-19 (dotted curve) and 61-70 (dashed curve). 
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FIG. 4: a) Native conformation of each monomer which build the HIV-l-PR dimer, based on 



the NMR conformation of ref. [l^. b) Snapshot of the samphng of the system composed of the 
protease and 3 p-Sg peptides. The protein has been drawn in hght grey and the S2 and Sg LES 
exphcitely marked in dark grey. The p-Sg peptides are shown in black. The ratio of contacts 
forming beta structure is 56% for the native conformation shown in (c), and decreases to 43% with 
3 peptides (cf. (d)) and to 29% with 5 peptides. 
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FIG. 5: a) The Lineweaver-Burk plot associated with the protease (empty circles) and the protease 
complexed with the inhibitor p-Sg (filled squares). The dashed lines are the linear fits to the 
experimental points. The initial velocities vq are expressed in terms of absorbance/s, while the 
substrate concentration \S\ is in mM. b) The enzymatic kinetics of the inhibited protease (1), of 
the protease alone (2), of the protease together with control peptides Kl (3) and K2 (4) , measured 
as change in absorbance of the chromogenic substrate as a function of time. All the curves have 
been measured at [5o] = 125mM, and have been shifted along the y-axis in order to be easily 
inspected. 
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FIG. 6: The circular dichroism spectrum of the protease (sohd curve) and of the solution composed 
of the protease and p-Ss peptide (dashed curve) in the ratio 1:3. 



22 



